Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 3756 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 851.1 KiB |
| Average record size in memory | 232.0 B |
Variable types
| Numeric | 17 |
|---|---|
| Categorical | 12 |
director_name has a high cardinality: 1659 distinct values | High cardinality |
actor_2_name has a high cardinality: 2188 distinct values | High cardinality |
genres has a high cardinality: 745 distinct values | High cardinality |
actor_1_name has a high cardinality: 1428 distinct values | High cardinality |
movie_title has a high cardinality: 3655 distinct values | High cardinality |
actor_3_name has a high cardinality: 2587 distinct values | High cardinality |
plot_keywords has a high cardinality: 3656 distinct values | High cardinality |
movie_imdb_link has a high cardinality: 3656 distinct values | High cardinality |
df_index is highly correlated with gross and 1 other fields | High correlation |
num_critic_for_reviews is highly correlated with num_voted_users and 2 other fields | High correlation |
actor_3_facebook_likes is highly correlated with actor_1_facebook_likes and 2 other fields | High correlation |
actor_1_facebook_likes is highly correlated with actor_3_facebook_likes and 2 other fields | High correlation |
gross is highly correlated with df_index and 3 other fields | High correlation |
num_voted_users is highly correlated with num_critic_for_reviews and 2 other fields | High correlation |
cast_total_facebook_likes is highly correlated with actor_3_facebook_likes and 2 other fields | High correlation |
num_user_for_reviews is highly correlated with num_critic_for_reviews and 2 other fields | High correlation |
budget is highly correlated with df_index and 1 other fields | High correlation |
title_year is highly correlated with num_critic_for_reviews | High correlation |
actor_2_facebook_likes is highly correlated with actor_3_facebook_likes and 2 other fields | High correlation |
num_critic_for_reviews is highly correlated with num_voted_users and 2 other fields | High correlation |
actor_3_facebook_likes is highly correlated with actor_2_facebook_likes | High correlation |
actor_1_facebook_likes is highly correlated with cast_total_facebook_likes | High correlation |
gross is highly correlated with num_voted_users and 1 other fields | High correlation |
num_voted_users is highly correlated with num_critic_for_reviews and 3 other fields | High correlation |
cast_total_facebook_likes is highly correlated with actor_1_facebook_likes and 1 other fields | High correlation |
num_user_for_reviews is highly correlated with num_critic_for_reviews and 2 other fields | High correlation |
actor_2_facebook_likes is highly correlated with actor_3_facebook_likes and 1 other fields | High correlation |
movie_facebook_likes is highly correlated with num_critic_for_reviews and 1 other fields | High correlation |
df_index is highly correlated with budget | High correlation |
num_critic_for_reviews is highly correlated with num_voted_users and 1 other fields | High correlation |
actor_3_facebook_likes is highly correlated with cast_total_facebook_likes and 1 other fields | High correlation |
actor_1_facebook_likes is highly correlated with cast_total_facebook_likes and 1 other fields | High correlation |
num_voted_users is highly correlated with num_critic_for_reviews and 1 other fields | High correlation |
cast_total_facebook_likes is highly correlated with actor_3_facebook_likes and 2 other fields | High correlation |
num_user_for_reviews is highly correlated with num_critic_for_reviews and 1 other fields | High correlation |
budget is highly correlated with df_index | High correlation |
actor_2_facebook_likes is highly correlated with actor_3_facebook_likes and 2 other fields | High correlation |
language is highly correlated with country | High correlation |
country is highly correlated with language | High correlation |
df_index is highly correlated with gross | High correlation |
num_critic_for_reviews is highly correlated with gross and 4 other fields | High correlation |
duration is highly correlated with country | High correlation |
director_facebook_likes is highly correlated with language | High correlation |
actor_3_facebook_likes is highly correlated with gross and 3 other fields | High correlation |
actor_1_facebook_likes is highly correlated with cast_total_facebook_likes | High correlation |
gross is highly correlated with df_index and 4 other fields | High correlation |
num_voted_users is highly correlated with num_critic_for_reviews and 5 other fields | High correlation |
cast_total_facebook_likes is highly correlated with actor_3_facebook_likes and 2 other fields | High correlation |
num_user_for_reviews is highly correlated with num_critic_for_reviews and 3 other fields | High correlation |
language is highly correlated with director_facebook_likes and 2 other fields | High correlation |
country is highly correlated with duration and 2 other fields | High correlation |
content_rating is highly correlated with title_year | High correlation |
budget is highly correlated with language and 1 other fields | High correlation |
title_year is highly correlated with num_critic_for_reviews and 1 other fields | High correlation |
actor_2_facebook_likes is highly correlated with actor_3_facebook_likes and 1 other fields | High correlation |
imdb_score is highly correlated with num_voted_users and 1 other fields | High correlation |
movie_facebook_likes is highly correlated with num_critic_for_reviews and 1 other fields | High correlation |
actor_1_facebook_likes is highly skewed (γ1 = 20.3394708) | Skewed |
budget is highly skewed (γ1 = 44.17414414) | Skewed |
movie_title is uniformly distributed | Uniform |
actor_3_name is uniformly distributed | Uniform |
plot_keywords is uniformly distributed | Uniform |
movie_imdb_link is uniformly distributed | Uniform |
df_index has unique values | Unique |
director_facebook_likes has 642 (17.1%) zeros | Zeros |
facenumber_in_poster has 1582 (42.1%) zeros | Zeros |
movie_facebook_likes has 1742 (46.4%) zeros | Zeros |
Reproduction
| Analysis started | 2022-05-11 15:02:11.244909 |
|---|---|
| Analysis finished | 2022-05-11 15:03:56.565576 |
| Duration | 1 minute and 45.32 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 3756 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2139.449414 |
| Minimum | 0 |
|---|---|
| Maximum | 5042 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 193.75 |
| Q1 | 989.75 |
| median | 2033.5 |
| Q3 | 3163.25 |
| 95-th percentile | 4538.25 |
| Maximum | 5042 |
| Range | 5042 |
| Interquartile range (IQR) | 2173.5 |
Descriptive statistics
| Standard deviation | 1345.761978 |
|---|---|
| Coefficient of variation (CV) | 0.6290225743 |
| Kurtosis | -0.9680602618 |
| Mean | 2139.449414 |
| Median Absolute Deviation (MAD) | 1085.5 |
| Skewness | 0.2735491187 |
| Sum | 8035772 |
| Variance | 1811075.302 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 2787 | 1 | < 0.1% |
| 2771 | 1 | < 0.1% |
| 2773 | 1 | < 0.1% |
| 2774 | 1 | < 0.1% |
| 2776 | 1 | < 0.1% |
| 2777 | 1 | < 0.1% |
| 2779 | 1 | < 0.1% |
| 2780 | 1 | < 0.1% |
| 2781 | 1 | < 0.1% |
| Other values (3746) | 3746 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 5042 | 1 | |
| 5035 | 1 | |
| 5033 | 1 | |
| 5027 | 1 | |
| 5026 | 1 | |
| 5025 | 1 | |
| 5015 | 1 | |
| 5012 | 1 | |
| 5011 | 1 | |
| 5008 | 1 |
color
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| Color | |
|---|---|
| Black and White | 124 |
Length
| Max length | 16 |
|---|---|
| Median length | 5 |
| Mean length | 5.36315229 |
| Min length | 5 |
Characters and Unicode
| Total characters | 20144 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Color |
|---|---|
| 2nd row | Color |
| 3rd row | Color |
| 4th row | Color |
| 5th row | Color |
Common Values
| Value | Count | Frequency (%) |
| Color | 3632 | |
| Black and White | 124 | 3.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| color | 3632 | |
| black | 124 | 3.1% |
| and | 124 | 3.1% |
| white | 124 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 7264 | |
| l | 3756 | |
| C | 3632 | |
| r | 3632 | |
| 372 | 1.8% | |
| a | 248 | 1.2% |
| B | 124 | 0.6% |
| c | 124 | 0.6% |
| k | 124 | 0.6% |
| n | 124 | 0.6% |
| Other values (6) | 744 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15892 | |
| Uppercase Letter | 3880 | 19.3% |
| Space Separator | 372 | 1.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 7264 | |
| l | 3756 | |
| r | 3632 | |
| a | 248 | 1.6% |
| c | 124 | 0.8% |
| k | 124 | 0.8% |
| n | 124 | 0.8% |
| d | 124 | 0.8% |
| h | 124 | 0.8% |
| i | 124 | 0.8% |
| Other values (2) | 248 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3632 | |
| B | 124 | 3.2% |
| W | 124 | 3.2% |
Space Separator
| Value | Count | Frequency (%) |
| 372 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19772 | |
| Common | 372 | 1.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 7264 | |
| l | 3756 | |
| C | 3632 | |
| r | 3632 | |
| a | 248 | 1.3% |
| B | 124 | 0.6% |
| c | 124 | 0.6% |
| k | 124 | 0.6% |
| n | 124 | 0.6% |
| d | 124 | 0.6% |
| Other values (5) | 620 | 3.1% |
Common
| Value | Count | Frequency (%) |
| 372 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20144 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 7264 | |
| l | 3756 | |
| C | 3632 | |
| r | 3632 | |
| 372 | 1.8% | |
| a | 248 | 1.2% |
| B | 124 | 0.6% |
| c | 124 | 0.6% |
| k | 124 | 0.6% |
| n | 124 | 0.6% |
| Other values (6) | 744 | 3.7% |
| Distinct | 1659 |
|---|---|
| Distinct (%) | 44.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| Steven Spielberg | 25 |
|---|---|
| Clint Eastwood | 19 |
| Woody Allen | 19 |
| Ridley Scott | 17 |
| Martin Scorsese | 16 |
| Other values (1654) |
Length
| Max length | 32 |
|---|---|
| Median length | 24 |
| Mean length | 13.03541001 |
| Min length | 3 |
Characters and Unicode
| Total characters | 48961 |
|---|---|
| Distinct characters | 74 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 942 ? |
|---|---|
| Unique (%) | 25.1% |
Sample
| 1st row | James Cameron |
|---|---|
| 2nd row | Gore Verbinski |
| 3rd row | Sam Mendes |
| 4th row | Christopher Nolan |
| 5th row | Andrew Stanton |
Common Values
| Value | Count | Frequency (%) |
| Steven Spielberg | 25 | 0.7% |
| Clint Eastwood | 19 | 0.5% |
| Woody Allen | 19 | 0.5% |
| Ridley Scott | 17 | 0.5% |
| Martin Scorsese | 16 | 0.4% |
| Steven Soderbergh | 16 | 0.4% |
| Tim Burton | 16 | 0.4% |
| Spike Lee | 15 | 0.4% |
| Renny Harlin | 15 | 0.4% |
| Ron Howard | 13 | 0.3% |
| Other values (1649) | 3585 |
Length
| Value | Count | Frequency (%) |
| john | 147 | 1.9% |
| david | 116 | 1.5% |
| michael | 97 | 1.2% |
| peter | 75 | 1.0% |
| james | 69 | 0.9% |
| robert | 68 | 0.9% |
| paul | 66 | 0.8% |
| steven | 56 | 0.7% |
| richard | 55 | 0.7% |
| scott | 54 | 0.7% |
| Other values (2125) | 6995 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4751 | 9.7% |
| 4042 | 8.3% | |
| a | 3836 | 7.8% |
| n | 3551 | 7.3% |
| r | 3390 | 6.9% |
| o | 2922 | 6.0% |
| i | 2754 | 5.6% |
| l | 2227 | 4.5% |
| t | 1797 | 3.7% |
| s | 1569 | 3.2% |
| Other values (64) | 18122 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36692 | |
| Uppercase Letter | 7974 | 16.3% |
| Space Separator | 4042 | 8.3% |
| Other Punctuation | 191 | 0.4% |
| Dash Punctuation | 62 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4751 | |
| a | 3836 | |
| n | 3551 | |
| r | 3390 | 9.2% |
| o | 2922 | 8.0% |
| i | 2754 | 7.5% |
| l | 2227 | 6.1% |
| t | 1797 | 4.9% |
| s | 1569 | 4.3% |
| h | 1384 | 3.8% |
| Other values (29) | 8511 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 781 | 9.8% |
| J | 703 | 8.8% |
| M | 670 | 8.4% |
| R | 590 | 7.4% |
| C | 538 | 6.7% |
| B | 503 | 6.3% |
| D | 461 | 5.8% |
| A | 415 | 5.2% |
| L | 381 | 4.8% |
| P | 380 | 4.8% |
| Other values (21) | 2552 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 176 | |
| ' | 15 | 7.9% |
Space Separator
| Value | Count | Frequency (%) |
| 4042 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 62 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44666 | |
| Common | 4295 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4751 | 10.6% |
| a | 3836 | 8.6% |
| n | 3551 | 8.0% |
| r | 3390 | 7.6% |
| o | 2922 | 6.5% |
| i | 2754 | 6.2% |
| l | 2227 | 5.0% |
| t | 1797 | 4.0% |
| s | 1569 | 3.5% |
| h | 1384 | 3.1% |
| Other values (60) | 16485 |
Common
| Value | Count | Frequency (%) |
| 4042 | ||
| . | 176 | 4.1% |
| - | 62 | 1.4% |
| ' | 15 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48870 | |
| None | 91 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4751 | 9.7% |
| 4042 | 8.3% | |
| a | 3836 | 7.8% |
| n | 3551 | 7.3% |
| r | 3390 | 6.9% |
| o | 2922 | 6.0% |
| i | 2754 | 5.6% |
| l | 2227 | 4.6% |
| t | 1797 | 3.7% |
| s | 1569 | 3.2% |
| Other values (46) | 18031 |
None
| Value | Count | Frequency (%) |
| é | 23 | |
| á | 15 | |
| ö | 13 | |
| ó | 11 | |
| å | 6 | 6.6% |
| ñ | 5 | 5.5% |
| ç | 4 | 4.4% |
| í | 3 | 3.3% |
| Ô | 2 | 2.2% |
| æ | 1 | 1.1% |
| Other values (8) | 8 | 8.8% |
num_critic_for_reviews
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 525 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 167.378328 |
| Minimum | 2 |
|---|---|
| Maximum | 813 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 28 |
| Q1 | 77 |
| median | 138.5 |
| Q3 | 224 |
| 95-th percentile | 417 |
| Maximum | 813 |
| Range | 811 |
| Interquartile range (IQR) | 147 |
Descriptive statistics
| Standard deviation | 123.4520402 |
|---|---|
| Coefficient of variation (CV) | 0.7375628713 |
| Kurtosis | 2.529591949 |
| Mean | 167.378328 |
| Median Absolute Deviation (MAD) | 70.5 |
| Skewness | 1.424722665 |
| Sum | 628673 |
| Variance | 15240.40623 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 81 | 29 | 0.8% |
| 98 | 23 | 0.6% |
| 111 | 22 | 0.6% |
| 63 | 22 | 0.6% |
| 94 | 22 | 0.6% |
| 112 | 22 | 0.6% |
| 97 | 22 | 0.6% |
| 60 | 21 | 0.6% |
| 75 | 21 | 0.6% |
| 61 | 21 | 0.6% |
| Other values (515) | 3531 |
| Value | Count | Frequency (%) |
| 2 | 3 | 0.1% |
| 4 | 2 | 0.1% |
| 5 | 2 | 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 4 | |
| 9 | 8 | |
| 10 | 8 | |
| 11 | 7 | |
| 12 | 9 |
| Value | Count | Frequency (%) |
| 813 | 1 | |
| 775 | 1 | |
| 765 | 1 | |
| 750 | 2 | |
| 739 | 1 | |
| 738 | 1 | |
| 733 | 1 | |
| 723 | 1 | |
| 712 | 1 | |
| 703 | 2 |
| Distinct | 151 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 110.2579872 |
| Minimum | 37 |
|---|---|
| Maximum | 330 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 37 |
|---|---|
| 5-th percentile | 85 |
| Q1 | 96 |
| median | 106 |
| Q3 | 120 |
| 95-th percentile | 148 |
| Maximum | 330 |
| Range | 293 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 22.64671656 |
|---|---|
| Coefficient of variation (CV) | 0.2053975148 |
| Kurtosis | 12.62739704 |
| Mean | 110.2579872 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 2.402551754 |
| Sum | 414129 |
| Variance | 512.8737711 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 101 | 113 | 3.0% |
| 100 | 104 | 2.8% |
| 98 | 103 | 2.7% |
| 95 | 95 | 2.5% |
| 99 | 93 | 2.5% |
| 107 | 92 | 2.4% |
| 106 | 91 | 2.4% |
| 90 | 90 | 2.4% |
| 97 | 90 | 2.4% |
| 110 | 88 | 2.3% |
| Other values (141) | 2797 |
| Value | Count | Frequency (%) |
| 37 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 53 | 1 | < 0.1% |
| 63 | 1 | < 0.1% |
| 66 | 1 | < 0.1% |
| 68 | 2 | 0.1% |
| 69 | 1 | < 0.1% |
| 72 | 3 | |
| 73 | 2 | 0.1% |
| 74 | 5 |
| Value | Count | Frequency (%) |
| 330 | 1 | |
| 325 | 1 | |
| 300 | 1 | |
| 293 | 1 | |
| 289 | 1 | |
| 280 | 1 | |
| 271 | 1 | |
| 251 | 2 | |
| 240 | 1 | |
| 236 | 1 |
| Distinct | 395 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 807.3365282 |
| Minimum | 0 |
|---|---|
| Maximum | 23000 |
| Zeros | 642 |
| Zeros (%) | 17.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 11 |
| median | 64 |
| Q3 | 235 |
| 95-th percentile | 2000 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 224 |
Descriptive statistics
| Standard deviation | 3068.171683 |
|---|---|
| Coefficient of variation (CV) | 3.800362767 |
| Kurtosis | 22.25900267 |
| Mean | 807.3365282 |
| Median Absolute Deviation (MAD) | 64 |
| Skewness | 4.754529215 |
| Sum | 3032356 |
| Variance | 9413677.474 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 642 | 17.1% |
| 6 | 40 | 1.1% |
| 7 | 40 | 1.1% |
| 11 | 39 | 1.0% |
| 13 | 38 | 1.0% |
| 12 | 36 | 1.0% |
| 10 | 35 | 0.9% |
| 23 | 34 | 0.9% |
| 8 | 31 | 0.8% |
| 11000 | 31 | 0.8% |
| Other values (385) | 2790 |
| Value | Count | Frequency (%) |
| 0 | 642 | |
| 2 | 24 | 0.6% |
| 3 | 30 | 0.8% |
| 4 | 30 | 0.8% |
| 5 | 26 | 0.7% |
| 6 | 40 | 1.1% |
| 7 | 40 | 1.1% |
| 8 | 31 | 0.8% |
| 9 | 31 | 0.8% |
| 10 | 35 | 0.9% |
| Value | Count | Frequency (%) |
| 23000 | 1 | < 0.1% |
| 22000 | 8 | 0.2% |
| 21000 | 10 | 0.3% |
| 18000 | 4 | 0.1% |
| 17000 | 16 | |
| 16000 | 27 | |
| 15000 | 2 | 0.1% |
| 14000 | 29 | |
| 13000 | 19 | |
| 12000 | 17 |
actor_3_facebook_likes
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 882 |
|---|---|
| Distinct (%) | 23.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 771.2795527 |
| Minimum | 0 |
|---|---|
| Maximum | 23000 |
| Zeros | 27 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 194 |
| median | 436 |
| Q3 | 691 |
| 95-th percentile | 1000 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 497 |
Descriptive statistics
| Standard deviation | 1894.249869 |
|---|---|
| Coefficient of variation (CV) | 2.455983518 |
| Kurtosis | 45.77112024 |
| Mean | 771.2795527 |
| Median Absolute Deviation (MAD) | 249 |
| Skewness | 6.370113892 |
| Sum | 2896926 |
| Variance | 3588182.567 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 116 | 3.1% |
| 11000 | 28 | 0.7% |
| 0 | 27 | 0.7% |
| 3000 | 26 | 0.7% |
| 2000 | 25 | 0.7% |
| 4000 | 17 | 0.5% |
| 10000 | 16 | 0.4% |
| 826 | 15 | 0.4% |
| 748 | 14 | 0.4% |
| 322 | 14 | 0.4% |
| Other values (872) | 3458 |
| Value | Count | Frequency (%) |
| 0 | 27 | |
| 2 | 8 | 0.2% |
| 3 | 8 | 0.2% |
| 4 | 13 | |
| 5 | 6 | 0.2% |
| 6 | 9 | 0.2% |
| 7 | 13 | |
| 8 | 9 | 0.2% |
| 9 | 6 | 0.2% |
| 10 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 23000 | 2 | 0.1% |
| 20000 | 1 | < 0.1% |
| 19000 | 5 | 0.1% |
| 17000 | 1 | < 0.1% |
| 16000 | 3 | 0.1% |
| 15000 | 1 | < 0.1% |
| 14000 | 6 | 0.2% |
| 13000 | 5 | 0.1% |
| 12000 | 8 | 0.2% |
| 11000 | 28 |
| Distinct | 2188 |
|---|---|
| Distinct (%) | 58.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| Morgan Freeman | 20 |
|---|---|
| Charlize Theron | 14 |
| Brad Pitt | 14 |
| James Franco | 11 |
| Meryl Streep | 10 |
| Other values (2183) |
Length
| Max length | 28 |
|---|---|
| Median length | 25 |
| Mean length | 13.08892439 |
| Min length | 3 |
Characters and Unicode
| Total characters | 49162 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1460 ? |
|---|---|
| Unique (%) | 38.9% |
Sample
| 1st row | Joel David Moore |
|---|---|
| 2nd row | Orlando Bloom |
| 3rd row | Rory Kinnear |
| 4th row | Christian Bale |
| 5th row | Samantha Morton |
Common Values
| Value | Count | Frequency (%) |
| Morgan Freeman | 20 | 0.5% |
| Charlize Theron | 14 | 0.4% |
| Brad Pitt | 14 | 0.4% |
| James Franco | 11 | 0.3% |
| Meryl Streep | 10 | 0.3% |
| Jason Flemyng | 10 | 0.3% |
| Adam Sandler | 9 | 0.2% |
| Will Ferrell | 9 | 0.2% |
| Bruce Willis | 9 | 0.2% |
| Angelina Jolie Pitt | 9 | 0.2% |
| Other values (2178) | 3641 |
Length
| Value | Count | Frequency (%) |
| michael | 70 | 0.9% |
| tom | 41 | 0.5% |
| james | 41 | 0.5% |
| jason | 40 | 0.5% |
| david | 39 | 0.5% |
| scott | 39 | 0.5% |
| robert | 34 | 0.4% |
| john | 32 | 0.4% |
| adam | 31 | 0.4% |
| thomas | 31 | 0.4% |
| Other values (2926) | 7391 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4631 | 9.4% |
| a | 4364 | 8.9% |
| 4033 | 8.2% | |
| n | 3571 | 7.3% |
| r | 3269 | 6.6% |
| i | 3068 | 6.2% |
| o | 2750 | 5.6% |
| l | 2588 | 5.3% |
| t | 1748 | 3.6% |
| s | 1625 | 3.3% |
| Other values (68) | 17515 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36939 | |
| Uppercase Letter | 8000 | 16.3% |
| Space Separator | 4033 | 8.2% |
| Other Punctuation | 141 | 0.3% |
| Dash Punctuation | 45 | 0.1% |
| Decimal Number | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4631 | |
| a | 4364 | |
| n | 3571 | |
| r | 3269 | |
| i | 3068 | 8.3% |
| o | 2750 | 7.4% |
| l | 2588 | 7.0% |
| t | 1748 | 4.7% |
| s | 1625 | 4.4% |
| h | 1333 | 3.6% |
| Other values (36) | 7992 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 752 | 9.4% |
| C | 613 | 7.7% |
| S | 609 | 7.6% |
| B | 565 | 7.1% |
| J | 559 | 7.0% |
| D | 500 | 6.2% |
| R | 450 | 5.6% |
| A | 446 | 5.6% |
| L | 382 | 4.8% |
| T | 358 | 4.5% |
| Other values (16) | 2766 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 93 | |
| ' | 48 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 4033 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 45 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44939 | |
| Common | 4223 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4631 | 10.3% |
| a | 4364 | 9.7% |
| n | 3571 | 7.9% |
| r | 3269 | 7.3% |
| i | 3068 | 6.8% |
| o | 2750 | 6.1% |
| l | 2588 | 5.8% |
| t | 1748 | 3.9% |
| s | 1625 | 3.6% |
| h | 1333 | 3.0% |
| Other values (62) | 15992 |
Common
| Value | Count | Frequency (%) |
| 4033 | ||
| . | 93 | 2.2% |
| ' | 48 | 1.1% |
| - | 45 | 1.1% |
| 0 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49075 | |
| None | 87 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4631 | 9.4% |
| a | 4364 | 8.9% |
| 4033 | 8.2% | |
| n | 3571 | 7.3% |
| r | 3269 | 6.7% |
| i | 3068 | 6.3% |
| o | 2750 | 5.6% |
| l | 2588 | 5.3% |
| t | 1748 | 3.6% |
| s | 1625 | 3.3% |
| Other values (48) | 17428 |
None
| Value | Count | Frequency (%) |
| é | 27 | |
| í | 13 | |
| á | 8 | 9.2% |
| ë | 6 | 6.9% |
| å | 5 | 5.7% |
| ø | 5 | 5.7% |
| ü | 3 | 3.4% |
| ö | 3 | 3.4% |
| ó | 3 | 3.4% |
| ï | 2 | 2.3% |
| Other values (10) | 12 |
actor_1_facebook_likes
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 713 |
|---|---|
| Distinct (%) | 19.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7751.338658 |
| Minimum | 0 |
|---|---|
| Maximum | 640000 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 210 |
| Q1 | 745 |
| median | 1000 |
| Q3 | 13000 |
| 95-th percentile | 26000 |
| Maximum | 640000 |
| Range | 640000 |
| Interquartile range (IQR) | 12255 |
Descriptive statistics
| Standard deviation | 15519.33962 |
|---|---|
| Coefficient of variation (CV) | 2.002149603 |
| Kurtosis | 757.7504015 |
| Mean | 7751.338658 |
| Median Absolute Deviation (MAD) | 942.5 |
| Skewness | 20.3394708 |
| Sum | 29114028 |
| Variance | 240849902.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 348 | 9.3% |
| 11000 | 190 | 5.1% |
| 2000 | 166 | 4.4% |
| 3000 | 137 | 3.6% |
| 12000 | 122 | 3.2% |
| 13000 | 112 | 3.0% |
| 14000 | 110 | 2.9% |
| 18000 | 104 | 2.8% |
| 10000 | 97 | 2.6% |
| 22000 | 71 | 1.9% |
| Other values (703) | 2299 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 2 | 5 | |
| 3 | 2 | 0.1% |
| 5 | 3 | |
| 6 | 2 | 0.1% |
| 7 | 2 | 0.1% |
| 9 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 15 | 2 | 0.1% |
| 17 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 640000 | 1 | < 0.1% |
| 260000 | 1 | < 0.1% |
| 164000 | 1 | < 0.1% |
| 137000 | 2 | 0.1% |
| 87000 | 7 | 0.2% |
| 49000 | 25 | |
| 46000 | 1 | < 0.1% |
| 45000 | 5 | 0.1% |
| 44000 | 2 | 0.1% |
| 40000 | 39 |
| Distinct | 3638 |
|---|---|
| Distinct (%) | 96.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52612824.24 |
| Minimum | 162 |
|---|---|
| Maximum | 760505847 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 162 |
|---|---|
| 5-th percentile | 196022.25 |
| Q1 | 8270232.75 |
| median | 30093107 |
| Q3 | 66881940.75 |
| 95-th percentile | 186762606.5 |
| Maximum | 760505847 |
| Range | 760505685 |
| Interquartile range (IQR) | 58611708 |
Descriptive statistics
| Standard deviation | 70317866.91 |
|---|---|
| Coefficient of variation (CV) | 1.336515725 |
| Kurtosis | 13.97005326 |
| Mean | 52612824.24 |
| Median Absolute Deviation (MAD) | 25209842 |
| Skewness | 3.029374712 |
| Sum | 1.976137678 × 1011 |
| Variance | 4.944602407 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 177343675 | 3 | 0.1% |
| 5773519 | 3 | 0.1% |
| 8000000 | 3 | 0.1% |
| 218051260 | 3 | 0.1% |
| 34964818 | 3 | 0.1% |
| 144512310 | 3 | 0.1% |
| 47000000 | 3 | 0.1% |
| 24343673 | 2 | 0.1% |
| 403932 | 2 | 0.1% |
| 58607007 | 2 | 0.1% |
| Other values (3628) | 3729 |
| Value | Count | Frequency (%) |
| 162 | 1 | |
| 703 | 1 | |
| 721 | 1 | |
| 1111 | 1 | |
| 1332 | 1 | |
| 2436 | 1 | |
| 2468 | 1 | |
| 2580 | 1 | |
| 2964 | 1 | |
| 3478 | 1 |
| Value | Count | Frequency (%) |
| 760505847 | 1 | |
| 658672302 | 1 | |
| 652177271 | 1 | |
| 623279547 | 2 | |
| 533316061 | 1 | |
| 474544677 | 1 | |
| 460935665 | 1 | |
| 458991599 | 1 | |
| 448130642 | 1 | |
| 436471036 | 1 |
| Distinct | 745 |
|---|---|
| Distinct (%) | 19.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| Comedy|Drama|Romance | 147 |
|---|---|
| Drama | 141 |
| Comedy|Drama | 138 |
| Comedy | 138 |
| Comedy|Romance | 131 |
| Other values (740) |
Length
| Max length | 64 |
|---|---|
| Median length | 52 |
| Mean length | 21.22018104 |
| Min length | 5 |
Characters and Unicode
| Total characters | 79703 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 393 ? |
|---|---|
| Unique (%) | 10.5% |
Sample
| 1st row | Action|Adventure|Fantasy|Sci-Fi |
|---|---|
| 2nd row | Action|Adventure|Fantasy |
| 3rd row | Action|Adventure|Thriller |
| 4th row | Action|Thriller |
| 5th row | Action|Adventure|Sci-Fi |
Common Values
| Value | Count | Frequency (%) |
| Comedy|Drama|Romance | 147 | 3.9% |
| Drama | 141 | 3.8% |
| Comedy|Drama | 138 | 3.7% |
| Comedy | 138 | 3.7% |
| Comedy|Romance | 131 | 3.5% |
| Drama|Romance | 115 | 3.1% |
| Crime|Drama|Thriller | 82 | 2.2% |
| Action|Crime|Thriller | 56 | 1.5% |
| Action|Crime|Drama|Thriller | 50 | 1.3% |
| Action|Adventure|Sci-Fi | 48 | 1.3% |
| Other values (735) | 2710 |
Length
| Value | Count | Frequency (%) |
| comedy|drama|romance | 147 | 3.9% |
| drama | 141 | 3.8% |
| comedy|drama | 138 | 3.7% |
| comedy | 138 | 3.7% |
| comedy|romance | 131 | 3.5% |
| drama|romance | 115 | 3.1% |
| crime|drama|thriller | 82 | 2.2% |
| action|crime|thriller | 56 | 1.5% |
| action|crime|drama|thriller | 50 | 1.3% |
| action|adventure|sci-fi | 48 | 1.3% |
| Other values (735) | 2710 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 7970 | 10.0% |
| | | 7480 | 9.4% |
| a | 6829 | 8.6% |
| e | 6255 | 7.8% |
| m | 5606 | 7.0% |
| i | 5248 | 6.6% |
| o | 4841 | 6.1% |
| y | 3611 | 4.5% |
| n | 3602 | 4.5% |
| t | 3228 | 4.1% |
| Other values (22) | 25033 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 59993 | |
| Uppercase Letter | 11733 | 14.7% |
| Math Symbol | 7480 | 9.4% |
| Dash Punctuation | 497 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 7970 | |
| a | 6829 | |
| e | 6255 | |
| m | 5606 | |
| i | 5248 | |
| o | 4841 | |
| y | 3611 | 6.0% |
| n | 3602 | 6.0% |
| t | 3228 | 5.4% |
| l | 2773 | 4.6% |
| Other values (8) | 10030 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2170 | |
| D | 1938 | |
| A | 1936 | |
| F | 1446 | |
| T | 1117 | |
| R | 859 | 7.3% |
| S | 644 | 5.5% |
| M | 631 | 5.4% |
| H | 541 | 4.6% |
| B | 239 | 2.0% |
| Other values (2) | 212 | 1.8% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 7480 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 497 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 71726 | |
| Common | 7977 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 7970 | 11.1% |
| a | 6829 | 9.5% |
| e | 6255 | 8.7% |
| m | 5606 | 7.8% |
| i | 5248 | 7.3% |
| o | 4841 | 6.7% |
| y | 3611 | 5.0% |
| n | 3602 | 5.0% |
| t | 3228 | 4.5% |
| l | 2773 | 3.9% |
| Other values (20) | 21763 |
Common
| Value | Count | Frequency (%) |
| | | 7480 | |
| - | 497 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79703 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 7970 | 10.0% |
| | | 7480 | 9.4% |
| a | 6829 | 8.6% |
| e | 6255 | 7.8% |
| m | 5606 | 7.0% |
| i | 5248 | 6.6% |
| o | 4841 | 6.1% |
| y | 3611 | 4.5% |
| n | 3602 | 4.5% |
| t | 3228 | 4.1% |
| Other values (22) | 25033 |
| Distinct | 1428 |
|---|---|
| Distinct (%) | 38.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| Robert De Niro | 42 |
|---|---|
| Johnny Depp | 39 |
| J.K. Simmons | 31 |
| Nicolas Cage | 31 |
| Denzel Washington | 30 |
| Other values (1423) |
Length
| Max length | 27 |
|---|---|
| Median length | 24 |
| Mean length | 13.14882854 |
| Min length | 4 |
Characters and Unicode
| Total characters | 49387 |
|---|---|
| Distinct characters | 72 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 870 ? |
|---|---|
| Unique (%) | 23.2% |
Sample
| 1st row | CCH Pounder |
|---|---|
| 2nd row | Johnny Depp |
| 3rd row | Christoph Waltz |
| 4th row | Tom Hardy |
| 5th row | Daryl Sabara |
Common Values
| Value | Count | Frequency (%) |
| Robert De Niro | 42 | 1.1% |
| Johnny Depp | 39 | 1.0% |
| J.K. Simmons | 31 | 0.8% |
| Nicolas Cage | 31 | 0.8% |
| Denzel Washington | 30 | 0.8% |
| Bruce Willis | 29 | 0.8% |
| Matt Damon | 28 | 0.7% |
| Liam Neeson | 26 | 0.7% |
| Robert Downey Jr. | 26 | 0.7% |
| Robin Williams | 25 | 0.7% |
| Other values (1418) | 3449 |
Length
| Value | Count | Frequency (%) |
| robert | 94 | 1.2% |
| tom | 84 | 1.1% |
| michael | 63 | 0.8% |
| de | 48 | 0.6% |
| jason | 46 | 0.6% |
| steve | 45 | 0.6% |
| will | 44 | 0.6% |
| niro | 42 | 0.5% |
| johnny | 41 | 0.5% |
| matt | 41 | 0.5% |
| Other values (2054) | 7238 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4658 | 9.4% |
| a | 4168 | 8.4% |
| 4030 | 8.2% | |
| n | 3600 | 7.3% |
| r | 3159 | 6.4% |
| i | 3148 | 6.4% |
| o | 2933 | 5.9% |
| l | 2493 | 5.0% |
| t | 1945 | 3.9% |
| s | 1775 | 3.6% |
| Other values (62) | 17478 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37155 | |
| Uppercase Letter | 7982 | 16.2% |
| Space Separator | 4030 | 8.2% |
| Other Punctuation | 168 | 0.3% |
| Dash Punctuation | 50 | 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4658 | |
| a | 4168 | |
| n | 3600 | |
| r | 3159 | 8.5% |
| i | 3148 | 8.5% |
| o | 2933 | 7.9% |
| l | 2493 | 6.7% |
| t | 1945 | 5.2% |
| s | 1775 | 4.8% |
| h | 1337 | 3.6% |
| Other values (29) | 7939 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 746 | 9.3% |
| M | 656 | 8.2% |
| C | 641 | 8.0% |
| S | 617 | 7.7% |
| D | 566 | 7.1% |
| B | 533 | 6.7% |
| R | 466 | 5.8% |
| H | 401 | 5.0% |
| W | 383 | 4.8% |
| L | 370 | 4.6% |
| Other values (17) | 2603 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 140 | |
| ' | 28 | 16.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4030 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 50 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45137 | |
| Common | 4250 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4658 | 10.3% |
| a | 4168 | 9.2% |
| n | 3600 | 8.0% |
| r | 3159 | 7.0% |
| i | 3148 | 7.0% |
| o | 2933 | 6.5% |
| l | 2493 | 5.5% |
| t | 1945 | 4.3% |
| s | 1775 | 3.9% |
| h | 1337 | 3.0% |
| Other values (56) | 15921 |
Common
| Value | Count | Frequency (%) |
| 4030 | ||
| . | 140 | 3.3% |
| - | 50 | 1.2% |
| ' | 28 | 0.7% |
| 5 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49332 | |
| None | 55 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4658 | 9.4% |
| a | 4168 | 8.4% |
| 4030 | 8.2% | |
| n | 3600 | 7.3% |
| r | 3159 | 6.4% |
| i | 3148 | 6.4% |
| o | 2933 | 5.9% |
| l | 2493 | 5.1% |
| t | 1945 | 3.9% |
| s | 1775 | 3.6% |
| Other values (48) | 17423 |
None
| Value | Count | Frequency (%) |
| ë | 14 | |
| é | 14 | |
| á | 5 | 9.1% |
| å | 4 | 7.3% |
| í | 4 | 7.3% |
| ç | 3 | 5.5% |
| à | 2 | 3.6% |
| ü | 2 | 3.6% |
| ø | 2 | 3.6% |
| ö | 1 | 1.8% |
| Other values (4) | 4 | 7.3% |
| Distinct | 3655 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| Home | 3 |
|---|---|
| Pan | 3 |
| King Kong | 3 |
| Halloween | 3 |
| Victor Frankenstein | 3 |
| Other values (3650) |
Length
| Max length | 84 |
|---|---|
| Median length | 53 |
| Mean length | 16.19941427 |
| Min length | 2 |
Characters and Unicode
| Total characters | 60845 |
|---|---|
| Distinct characters | 89 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3560 ? |
|---|---|
| Unique (%) | 94.8% |
Sample
| 1st row | Avatar |
|---|---|
| 2nd row | Pirates of the Caribbean: At World's End |
| 3rd row | Spectre |
| 4th row | The Dark Knight Rises |
| 5th row | John Carter |
Common Values
| Value | Count | Frequency (%) |
| Home | 3 | 0.1% |
| Pan | 3 | 0.1% |
| King Kong | 3 | 0.1% |
| Halloween | 3 | 0.1% |
| Victor Frankenstein | 3 | 0.1% |
| The Fast and the Furious | 3 | 0.1% |
| The Island | 2 | 0.1% |
| Dawn of the Dead | 2 | 0.1% |
| Around the World in 80 Days | 2 | 0.1% |
| Mercury Rising | 2 | 0.1% |
| Other values (3645) | 3730 |
Length
| Value | Count | Frequency (%) |
| the | 1204 | 11.6% |
| of | 353 | 3.4% |
| a | 132 | 1.3% |
| and | 104 | 1.0% |
| 2 | 97 | 0.9% |
| in | 97 | 0.9% |
| to | 71 | 0.7% |
| 60 | 0.6% | |
| man | 56 | 0.5% |
| on | 43 | 0.4% |
| Other values (3872) | 8130 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6591 | 10.8% | |
| e | 5842 | 9.6% |
| 3756 | 6.2% | |
| a | 3562 | 5.9% |
| o | 3445 | 5.7% |
| r | 3088 | 5.1% |
| n | 3064 | 5.0% |
| i | 2924 | 4.8% |
| t | 2828 | 4.6% |
| s | 2247 | 3.7% |
| Other values (79) | 23498 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40252 | |
| Space Separator | 10347 | 17.0% |
| Uppercase Letter | 9087 | 14.9% |
| Other Punctuation | 677 | 1.1% |
| Decimal Number | 402 | 0.7% |
| Dash Punctuation | 69 | 0.1% |
| Currency Symbol | 3 | < 0.1% |
| Other Number | 2 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5842 | |
| a | 3562 | 8.8% |
| o | 3445 | 8.6% |
| r | 3088 | 7.7% |
| n | 3064 | 7.6% |
| i | 2924 | 7.3% |
| t | 2828 | 7.0% |
| s | 2247 | 5.6% |
| h | 2204 | 5.5% |
| l | 1885 | 4.7% |
| Other values (21) | 9163 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1302 | |
| S | 781 | 8.6% |
| M | 629 | 6.9% |
| B | 570 | 6.3% |
| D | 520 | 5.7% |
| C | 507 | 5.6% |
| A | 488 | 5.4% |
| H | 417 | 4.6% |
| L | 414 | 4.6% |
| W | 377 | 4.1% |
| Other values (17) | 3082 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 128 | |
| 3 | 74 | |
| 0 | 59 | |
| 1 | 56 | |
| 4 | 24 | 6.0% |
| 5 | 17 | 4.2% |
| 8 | 15 | 3.7% |
| 9 | 13 | 3.2% |
| 6 | 9 | 2.2% |
| 7 | 7 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 284 | |
| ' | 154 | |
| . | 98 | 14.5% |
| , | 49 | 7.2% |
| & | 47 | 6.9% |
| ! | 25 | 3.7% |
| ? | 12 | 1.8% |
| / | 7 | 1.0% |
| · | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6591 | ||
| 3756 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 2 | |
| $ | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 | |
| [ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 | |
| ] | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 69 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49339 | |
| Common | 11506 | 18.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5842 | 11.8% |
| a | 3562 | 7.2% |
| o | 3445 | 7.0% |
| r | 3088 | 6.3% |
| n | 3064 | 6.2% |
| i | 2924 | 5.9% |
| t | 2828 | 5.7% |
| s | 2247 | 4.6% |
| h | 2204 | 4.5% |
| l | 1885 | 3.8% |
| Other values (48) | 18250 |
Common
| Value | Count | Frequency (%) |
| 6591 | ||
| 3756 | ||
| : | 284 | 2.5% |
| ' | 154 | 1.3% |
| 2 | 128 | 1.1% |
| . | 98 | 0.9% |
| 3 | 74 | 0.6% |
| - | 69 | 0.6% |
| 0 | 59 | 0.5% |
| 1 | 56 | 0.5% |
| Other values (21) | 237 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57075 | |
| None | 3770 | 6.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6591 | 11.5% | |
| e | 5842 | 10.2% |
| a | 3562 | 6.2% |
| o | 3445 | 6.0% |
| r | 3088 | 5.4% |
| n | 3064 | 5.4% |
| i | 2924 | 5.1% |
| t | 2828 | 5.0% |
| s | 2247 | 3.9% |
| h | 2204 | 3.9% |
| Other values (69) | 21280 |
None
| Value | Count | Frequency (%) |
| 3756 | ||
| é | 4 | 0.1% |
| ¢ | 2 | 0.1% |
| ½ | 2 | 0.1% |
| è | 1 | < 0.1% |
| ñ | 1 | < 0.1% |
| á | 1 | < 0.1% |
| Æ | 1 | < 0.1% |
| · | 1 | < 0.1% |
| ü | 1 | < 0.1% |
num_voted_users
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 3674 |
|---|---|
| Distinct (%) | 97.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 105826.7327 |
| Minimum | 91 |
|---|---|
| Maximum | 1689764 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 91 |
|---|---|
| 5-th percentile | 3673.25 |
| Q1 | 19667 |
| median | 53973.5 |
| Q3 | 128602 |
| 95-th percentile | 385889 |
| Maximum | 1689764 |
| Range | 1689673 |
| Interquartile range (IQR) | 108935 |
Descriptive statistics
| Standard deviation | 152035.3993 |
|---|---|
| Coefficient of variation (CV) | 1.436644555 |
| Kurtosis | 19.98071987 |
| Mean | 105826.7327 |
| Median Absolute Deviation (MAD) | 41383 |
| Skewness | 3.650372696 |
| Sum | 397485208 |
| Variance | 2.311476264 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3665 | 3 | 0.1% |
| 7973 | 2 | 0.1% |
| 110486 | 2 | 0.1% |
| 5254 | 2 | 0.1% |
| 4288 | 2 | 0.1% |
| 23928 | 2 | 0.1% |
| 113068 | 2 | 0.1% |
| 28621 | 2 | 0.1% |
| 23023 | 2 | 0.1% |
| 12980 | 2 | 0.1% |
| Other values (3664) | 3735 |
| Value | Count | Frequency (%) |
| 91 | 1 | |
| 154 | 1 | |
| 241 | 1 | |
| 344 | 1 | |
| 397 | 1 | |
| 448 | 1 | |
| 449 | 1 | |
| 475 | 1 | |
| 480 | 1 | |
| 524 | 1 |
| Value | Count | Frequency (%) |
| 1689764 | 1 | |
| 1676169 | 1 | |
| 1468200 | 1 | |
| 1347461 | 1 | |
| 1324680 | 1 | |
| 1251222 | 1 | |
| 1238746 | 1 | |
| 1217752 | 1 | |
| 1215718 | 1 | |
| 1155770 | 1 |
cast_total_facebook_likes
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 3243 |
|---|---|
| Distinct (%) | 86.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11527.10197 |
| Minimum | 0 |
|---|---|
| Maximum | 656730 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 449 |
| Q1 | 1919.75 |
| median | 4059.5 |
| Q3 | 16240 |
| 95-th percentile | 41428.25 |
| Maximum | 656730 |
| Range | 656730 |
| Interquartile range (IQR) | 14320.25 |
Descriptive statistics
| Standard deviation | 19122.17691 |
|---|---|
| Coefficient of variation (CV) | 1.658888501 |
| Kurtosis | 369.5087978 |
| Mean | 11527.10197 |
| Median Absolute Deviation (MAD) | 3113.5 |
| Skewness | 12.89487374 |
| Sum | 43295795 |
| Variance | 365657649.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2730 | 4 | 0.1% |
| 1520 | 4 | 0.1% |
| 2323 | 4 | 0.1% |
| 2 | 4 | 0.1% |
| 1136 | 4 | 0.1% |
| 2990 | 4 | 0.1% |
| 2348 | 4 | 0.1% |
| 2486 | 4 | 0.1% |
| 1044 | 4 | 0.1% |
| 2321 | 4 | 0.1% |
| Other values (3233) | 3716 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 2 | 4 | |
| 4 | 1 | < 0.1% |
| 5 | 3 | |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 15 | 2 | |
| 28 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 656730 | 1 | |
| 303717 | 1 | |
| 263584 | 1 | |
| 140268 | 1 | |
| 137712 | 1 | |
| 120797 | 1 | |
| 108016 | 1 | |
| 106759 | 1 | |
| 103354 | 1 | |
| 101383 | 1 |
| Distinct | 2587 |
|---|---|
| Distinct (%) | 68.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| Steve Coogan | 8 |
|---|---|
| Ben Mendelsohn | 7 |
| Robert Duvall | 7 |
| Kirsten Dunst | 7 |
| Anne Hathaway | 7 |
| Other values (2582) |
Length
| Max length | 27 |
|---|---|
| Median length | 24 |
| Mean length | 13.05777423 |
| Min length | 3 |
Characters and Unicode
| Total characters | 49045 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1886 ? |
|---|---|
| Unique (%) | 50.2% |
Sample
| 1st row | Wes Studi |
|---|---|
| 2nd row | Jack Davenport |
| 3rd row | Stephanie Sigman |
| 4th row | Joseph Gordon-Levitt |
| 5th row | Polly Walker |
Common Values
| Value | Count | Frequency (%) |
| Steve Coogan | 8 | 0.2% |
| Ben Mendelsohn | 7 | 0.2% |
| Robert Duvall | 7 | 0.2% |
| Kirsten Dunst | 7 | 0.2% |
| Anne Hathaway | 7 | 0.2% |
| Sam Shepard | 6 | 0.2% |
| Craig T. Nelson | 6 | 0.2% |
| Kevin Dunn | 6 | 0.2% |
| Stephen Root | 6 | 0.2% |
| Kevin Pollak | 6 | 0.2% |
| Other values (2577) | 3690 |
Length
| Value | Count | Frequency (%) |
| michael | 67 | 0.9% |
| james | 52 | 0.7% |
| david | 52 | 0.7% |
| john | 47 | 0.6% |
| robert | 37 | 0.5% |
| kevin | 36 | 0.5% |
| peter | 31 | 0.4% |
| tom | 31 | 0.4% |
| steve | 31 | 0.4% |
| scott | 29 | 0.4% |
| Other values (3336) | 7371 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4630 | 9.4% |
| a | 4442 | 9.1% |
| 4028 | 8.2% | |
| n | 3470 | 7.1% |
| r | 3123 | 6.4% |
| i | 2965 | 6.0% |
| o | 2677 | 5.5% |
| l | 2673 | 5.5% |
| t | 1754 | 3.6% |
| s | 1718 | 3.5% |
| Other values (68) | 17565 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36762 | |
| Uppercase Letter | 8010 | 16.3% |
| Space Separator | 4028 | 8.2% |
| Other Punctuation | 185 | 0.4% |
| Dash Punctuation | 58 | 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4630 | |
| a | 4442 | |
| n | 3470 | |
| r | 3123 | 8.5% |
| i | 2965 | 8.1% |
| o | 2677 | 7.3% |
| l | 2673 | 7.3% |
| t | 1754 | 4.8% |
| s | 1718 | 4.7% |
| h | 1364 | 3.7% |
| Other values (33) | 7946 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 758 | 9.5% |
| S | 613 | 7.7% |
| J | 611 | 7.6% |
| B | 598 | 7.5% |
| C | 587 | 7.3% |
| R | 488 | 6.1% |
| D | 477 | 6.0% |
| A | 422 | 5.3% |
| L | 381 | 4.8% |
| H | 354 | 4.4% |
| Other values (19) | 2721 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 141 | |
| ' | 44 | 23.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4028 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 58 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44772 | |
| Common | 4273 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4630 | 10.3% |
| a | 4442 | 9.9% |
| n | 3470 | 7.8% |
| r | 3123 | 7.0% |
| i | 2965 | 6.6% |
| o | 2677 | 6.0% |
| l | 2673 | 6.0% |
| t | 1754 | 3.9% |
| s | 1718 | 3.8% |
| h | 1364 | 3.0% |
| Other values (62) | 15956 |
Common
| Value | Count | Frequency (%) |
| 4028 | ||
| . | 141 | 3.3% |
| - | 58 | 1.4% |
| ' | 44 | 1.0% |
| 5 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48950 | |
| None | 95 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4630 | 9.5% |
| a | 4442 | 9.1% |
| 4028 | 8.2% | |
| n | 3470 | 7.1% |
| r | 3123 | 6.4% |
| i | 2965 | 6.1% |
| o | 2677 | 5.5% |
| l | 2673 | 5.5% |
| t | 1754 | 3.6% |
| s | 1718 | 3.5% |
| Other values (48) | 17470 |
None
| Value | Count | Frequency (%) |
| é | 33 | |
| á | 11 | 11.6% |
| í | 7 | 7.4% |
| ë | 7 | 7.4% |
| à | 6 | 6.3% |
| ü | 5 | 5.3% |
| ó | 5 | 5.3% |
| è | 4 | 4.2% |
| ø | 3 | 3.2% |
| ô | 2 | 2.1% |
| Other values (10) | 12 | 12.6% |
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.377263046 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1582 |
| Zeros (%) | 42.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.041540518 |
|---|---|
| Coefficient of variation (CV) | 1.482317067 |
| Kurtosis | 63.74365948 |
| Mean | 1.377263046 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.94941434 |
| Sum | 5173 |
| Variance | 4.167887687 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1582 | |
| 1 | 955 | |
| 2 | 533 | 14.2% |
| 3 | 294 | 7.8% |
| 4 | 163 | 4.3% |
| 5 | 76 | 2.0% |
| 6 | 57 | 1.5% |
| 8 | 32 | 0.9% |
| 7 | 30 | 0.8% |
| 9 | 11 | 0.3% |
| Other values (9) | 23 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 1582 | |
| 1 | 955 | |
| 2 | 533 | 14.2% |
| 3 | 294 | 7.8% |
| 4 | 163 | 4.3% |
| 5 | 76 | 2.0% |
| 6 | 57 | 1.5% |
| 7 | 30 | 0.8% |
| 8 | 32 | 0.9% |
| 9 | 11 | 0.3% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 15 | 4 | 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 3 | 0.1% |
| 11 | 5 | |
| 10 | 6 | |
| 9 | 11 |
| Distinct | 3656 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| alien friendship|alien invasion|australia|flying car|mother daughter relationship | 3 |
|---|---|
| 1940s|child hero|fantasy world|orphan|reference to peter pan | 3 |
| animal name in title|ape abducts a woman|gorilla|island|king kong | 3 |
| halloween|masked killer|michael myers|slasher|trick or treat | 3 |
| assistant|experiment|frankenstein|medical student|scientist | 3 |
| Other values (3651) |
Length
| Max length | 149 |
|---|---|
| Median length | 98 |
| Mean length | 52.49813632 |
| Min length | 6 |
Characters and Unicode
| Total characters | 197183 |
|---|---|
| Distinct characters | 42 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3562 ? |
|---|---|
| Unique (%) | 94.8% |
Sample
| 1st row | avatar|future|marine|native|paraplegic |
|---|---|
| 2nd row | goddess|marriage ceremony|marriage proposal|pirate|singapore |
| 3rd row | bomb|espionage|sequel|spy|terrorist |
| 4th row | deception|imprisonment|lawlessness|police officer|terrorist plot |
| 5th row | alien|american civil war|male nipple|mars|princess |
Common Values
| Value | Count | Frequency (%) |
| alien friendship|alien invasion|australia|flying car|mother daughter relationship | 3 | 0.1% |
| 1940s|child hero|fantasy world|orphan|reference to peter pan | 3 | 0.1% |
| animal name in title|ape abducts a woman|gorilla|island|king kong | 3 | 0.1% |
| halloween|masked killer|michael myers|slasher|trick or treat | 3 | 0.1% |
| assistant|experiment|frankenstein|medical student|scientist | 3 | 0.1% |
| eighteen wheeler|illegal street racing|truck|trucker|undercover cop | 3 | 0.1% |
| clone|environment|escape|island|lottery | 2 | 0.1% |
| mall|mayhem|nurse|rear entry sex|survival horror | 2 | 0.1% |
| 19th century|around the world|inventor|martial arts|train | 2 | 0.1% |
| autistic child|boy|child in danger|fbi|nsa | 2 | 0.1% |
| Other values (3646) | 3730 |
Length
| Value | Count | Frequency (%) |
| in | 213 | 1.6% |
| of | 171 | 1.3% |
| on | 157 | 1.2% |
| a | 151 | 1.1% |
| the | 148 | 1.1% |
| to | 126 | 0.9% |
| york | 93 | 0.7% |
| female | 80 | 0.6% |
| based | 78 | 0.6% |
| by | 68 | 0.5% |
| Other values (9008) | 12364 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 19076 | 9.7% |
| a | 15089 | 7.7% |
| | | 14958 | 7.6% |
| i | 14318 | 7.3% |
| r | 13997 | 7.1% |
| t | 12338 | 6.3% |
| n | 11994 | 6.1% |
| o | 11955 | 6.1% |
| s | 10231 | 5.2% |
| 9893 | 5.0% | |
| Other values (32) | 63334 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 171292 | |
| Math Symbol | 14958 | 7.6% |
| Space Separator | 9893 | 5.0% |
| Decimal Number | 856 | 0.4% |
| Other Punctuation | 182 | 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 19076 | |
| a | 15089 | 8.8% |
| i | 14318 | 8.4% |
| r | 13997 | 8.2% |
| t | 12338 | 7.2% |
| n | 11994 | 7.0% |
| o | 11955 | 7.0% |
| s | 10231 | 6.0% |
| l | 8540 | 5.0% |
| c | 7392 | 4.3% |
| Other values (16) | 46362 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 221 | |
| 0 | 195 | |
| 9 | 176 | |
| 2 | 50 | 5.8% |
| 8 | 49 | 5.7% |
| 5 | 40 | 4.7% |
| 7 | 38 | 4.4% |
| 6 | 31 | 3.6% |
| 3 | 30 | 3.5% |
| 4 | 26 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 112 | |
| ' | 70 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 14958 |
Space Separator
| Value | Count | Frequency (%) |
| 9893 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 171292 | |
| Common | 25891 | 13.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 19076 | |
| a | 15089 | 8.8% |
| i | 14318 | 8.4% |
| r | 13997 | 8.2% |
| t | 12338 | 7.2% |
| n | 11994 | 7.0% |
| o | 11955 | 7.0% |
| s | 10231 | 6.0% |
| l | 8540 | 5.0% |
| c | 7392 | 4.3% |
| Other values (16) | 46362 |
Common
| Value | Count | Frequency (%) |
| | | 14958 | |
| 9893 | ||
| 1 | 221 | 0.9% |
| 0 | 195 | 0.8% |
| 9 | 176 | 0.7% |
| . | 112 | 0.4% |
| ' | 70 | 0.3% |
| 2 | 50 | 0.2% |
| 8 | 49 | 0.2% |
| 5 | 40 | 0.2% |
| Other values (6) | 127 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 197183 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 19076 | 9.7% |
| a | 15089 | 7.7% |
| | | 14958 | 7.6% |
| i | 14318 | 7.3% |
| r | 13997 | 7.1% |
| t | 12338 | 6.3% |
| n | 11994 | 6.1% |
| o | 11955 | 6.1% |
| s | 10231 | 5.2% |
| 9893 | 5.0% | |
| Other values (32) | 63334 |
| Distinct | 3656 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| http://www.imdb.com/title/tt2224026/?ref_=fn_tt_tt_1 | 3 |
|---|---|
| http://www.imdb.com/title/tt3332064/?ref_=fn_tt_tt_1 | 3 |
| http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1 | 3 |
| http://www.imdb.com/title/tt0077651/?ref_=fn_tt_tt_1 | 3 |
| http://www.imdb.com/title/tt1976009/?ref_=fn_tt_tt_1 | 3 |
| Other values (3651) |
Length
| Max length | 52 |
|---|---|
| Median length | 52 |
| Mean length | 52 |
| Min length | 52 |
Characters and Unicode
| Total characters | 195312 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3562 ? |
|---|---|
| Unique (%) | 94.8% |
Sample
| 1st row | http://www.imdb.com/title/tt0499549/?ref_=fn_tt_tt_1 |
|---|---|
| 2nd row | http://www.imdb.com/title/tt0449088/?ref_=fn_tt_tt_1 |
| 3rd row | http://www.imdb.com/title/tt2379713/?ref_=fn_tt_tt_1 |
| 4th row | http://www.imdb.com/title/tt1345836/?ref_=fn_tt_tt_1 |
| 5th row | http://www.imdb.com/title/tt0401729/?ref_=fn_tt_tt_1 |
Common Values
| Value | Count | Frequency (%) |
| http://www.imdb.com/title/tt2224026/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt3332064/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt0077651/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt1976009/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt0232500/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt0399201/?ref_=fn_tt_tt_1 | 2 | 0.1% |
| http://www.imdb.com/title/tt0363547/?ref_=fn_tt_tt_1 | 2 | 0.1% |
| http://www.imdb.com/title/tt0327437/?ref_=fn_tt_tt_1 | 2 | 0.1% |
| http://www.imdb.com/title/tt0120749/?ref_=fn_tt_tt_1 | 2 | 0.1% |
| Other values (3646) | 3730 |
Length
| Value | Count | Frequency (%) |
| http://www.imdb.com/title/tt2224026/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt0360717/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt0077651/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt1976009/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt0232500/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt3332064/?ref_=fn_tt_tt_1 | 3 | 0.1% |
| http://www.imdb.com/title/tt2058673/?ref_=fn_tt_tt_1 | 2 | 0.1% |
| http://www.imdb.com/title/tt1939659/?ref_=fn_tt_tt_1 | 2 | 0.1% |
| http://www.imdb.com/title/tt2053463/?ref_=fn_tt_tt_1 | 2 | 0.1% |
| http://www.imdb.com/title/tt1099212/?ref_=fn_tt_tt_1 | 2 | 0.1% |
| Other values (3646) | 3730 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 37560 | |
| / | 18780 | 9.6% |
| _ | 15024 | 7.7% |
| w | 11268 | 5.8% |
| e | 7512 | 3.8% |
| f | 7512 | 3.8% |
| . | 7512 | 3.8% |
| i | 7512 | 3.8% |
| m | 7512 | 3.8% |
| 1 | 7504 | 3.8% |
| Other values (21) | 67616 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 112680 | |
| Other Punctuation | 33804 | 17.3% |
| Decimal Number | 30048 | 15.4% |
| Connector Punctuation | 15024 | 7.7% |
| Math Symbol | 3756 | 1.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 37560 | |
| w | 11268 | 10.0% |
| e | 7512 | 6.7% |
| f | 7512 | 6.7% |
| i | 7512 | 6.7% |
| m | 7512 | 6.7% |
| r | 3756 | 3.3% |
| n | 3756 | 3.3% |
| h | 3756 | 3.3% |
| l | 3756 | 3.3% |
| Other values (5) | 18780 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7504 | |
| 0 | 5171 | |
| 2 | 2717 | 9.0% |
| 3 | 2385 | 7.9% |
| 4 | 2296 | 7.6% |
| 8 | 2136 | 7.1% |
| 9 | 2055 | 6.8% |
| 7 | 1994 | 6.6% |
| 6 | 1967 | 6.5% |
| 5 | 1823 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 18780 | |
| . | 7512 | 22.2% |
| ? | 3756 | 11.1% |
| : | 3756 | 11.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 15024 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3756 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 112680 | |
| Common | 82632 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 18780 | |
| _ | 15024 | |
| . | 7512 | 9.1% |
| 1 | 7504 | 9.1% |
| 0 | 5171 | 6.3% |
| ? | 3756 | 4.5% |
| = | 3756 | 4.5% |
| : | 3756 | 4.5% |
| 2 | 2717 | 3.3% |
| 3 | 2385 | 2.9% |
| Other values (6) | 12271 |
Latin
| Value | Count | Frequency (%) |
| t | 37560 | |
| w | 11268 | 10.0% |
| e | 7512 | 6.7% |
| f | 7512 | 6.7% |
| i | 7512 | 6.7% |
| m | 7512 | 6.7% |
| r | 3756 | 3.3% |
| n | 3756 | 3.3% |
| h | 3756 | 3.3% |
| l | 3756 | 3.3% |
| Other values (5) | 18780 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 195312 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 37560 | |
| / | 18780 | 9.6% |
| _ | 15024 | 7.7% |
| w | 11268 | 5.8% |
| e | 7512 | 3.8% |
| f | 7512 | 3.8% |
| . | 7512 | 3.8% |
| i | 7512 | 3.8% |
| m | 7512 | 3.8% |
| 1 | 7504 | 3.8% |
| Other values (21) | 67616 |
num_user_for_reviews
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 940 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 336.8431842 |
| Minimum | 4 |
|---|---|
| Maximum | 5060 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 110 |
| median | 210 |
| Q3 | 398.25 |
| 95-th percentile | 1044.5 |
| Maximum | 5060 |
| Range | 5056 |
| Interquartile range (IQR) | 288.25 |
Descriptive statistics
| Standard deviation | 411.2273684 |
|---|---|
| Coefficient of variation (CV) | 1.220827339 |
| Kurtosis | 22.48974413 |
| Mean | 336.8431842 |
| Median Absolute Deviation (MAD) | 122 |
| Skewness | 3.844203716 |
| Sum | 1265183 |
| Variance | 169107.9485 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 18 | 0.5% |
| 126 | 17 | 0.5% |
| 26 | 17 | 0.5% |
| 100 | 16 | 0.4% |
| 162 | 16 | 0.4% |
| 181 | 16 | 0.4% |
| 88 | 15 | 0.4% |
| 84 | 15 | 0.4% |
| 194 | 15 | 0.4% |
| 132 | 15 | 0.4% |
| Other values (930) | 3596 |
| Value | Count | Frequency (%) |
| 4 | 1 | < 0.1% |
| 5 | 3 | |
| 6 | 2 | 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 6 | |
| 11 | 4 | |
| 12 | 5 | |
| 13 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 5060 | 1 | |
| 4667 | 1 | |
| 4144 | 1 | |
| 3646 | 1 | |
| 3597 | 1 | |
| 3516 | 1 | |
| 3400 | 1 | |
| 3286 | 1 | |
| 3189 | 1 | |
| 3054 | 1 |
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| English | |
|---|---|
| French | 34 |
| Spanish | 23 |
| Mandarin | 15 |
| German | 10 |
| Other values (29) | 76 |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.99627263 |
| Min length | 4 |
Characters and Unicode
| Total characters | 26278 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | English |
|---|---|
| 2nd row | English |
| 3rd row | English |
| 4th row | English |
| 5th row | English |
Common Values
| Value | Count | Frequency (%) |
| English | 3598 | |
| French | 34 | 0.9% |
| Spanish | 23 | 0.6% |
| Mandarin | 15 | 0.4% |
| German | 10 | 0.3% |
| Japanese | 10 | 0.3% |
| Cantonese | 7 | 0.2% |
| Italian | 7 | 0.2% |
| Portuguese | 5 | 0.1% |
| Hindi | 5 | 0.1% |
| Other values (24) | 42 | 1.1% |
Length
| Value | Count | Frequency (%) |
| english | 3598 | |
| french | 34 | 0.9% |
| spanish | 23 | 0.6% |
| mandarin | 15 | 0.4% |
| german | 10 | 0.3% |
| japanese | 10 | 0.3% |
| cantonese | 7 | 0.2% |
| italian | 7 | 0.2% |
| korean | 5 | 0.1% |
| portuguese | 5 | 0.1% |
| Other values (24) | 42 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 3766 | |
| i | 3685 | |
| h | 3666 | |
| s | 3655 | |
| g | 3611 | |
| l | 3610 | |
| E | 3598 | |
| a | 143 | 0.5% |
| e | 109 | 0.4% |
| r | 84 | 0.3% |
| Other values (30) | 351 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22522 | |
| Uppercase Letter | 3756 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 3766 | |
| i | 3685 | |
| h | 3666 | |
| s | 3655 | |
| g | 3611 | |
| l | 3610 | |
| a | 143 | 0.6% |
| e | 109 | 0.5% |
| r | 84 | 0.4% |
| c | 40 | 0.2% |
| Other values (11) | 153 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3598 | |
| F | 35 | 0.9% |
| S | 23 | 0.6% |
| M | 17 | 0.5% |
| J | 10 | 0.3% |
| G | 10 | 0.3% |
| I | 9 | 0.2% |
| D | 8 | 0.2% |
| C | 8 | 0.2% |
| P | 8 | 0.2% |
| Other values (9) | 30 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26278 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 3766 | |
| i | 3685 | |
| h | 3666 | |
| s | 3655 | |
| g | 3611 | |
| l | 3610 | |
| E | 3598 | |
| a | 143 | 0.5% |
| e | 109 | 0.4% |
| r | 84 | 0.3% |
| Other values (30) | 351 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26278 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 3766 | |
| i | 3685 | |
| h | 3666 | |
| s | 3655 | |
| g | 3611 | |
| l | 3610 | |
| E | 3598 | |
| a | 143 | 0.5% |
| e | 109 | 0.4% |
| r | 84 | 0.3% |
| Other values (30) | 351 | 1.3% |
| Distinct | 45 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| USA | |
|---|---|
| UK | |
| France | 101 |
| Germany | 80 |
| Canada | 59 |
| Other values (40) | 211 |
Length
| Max length | 14 |
|---|---|
| Median length | 3 |
| Mean length | 3.375665602 |
| Min length | 2 |
Characters and Unicode
| Total characters | 12679 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | USA |
| 3rd row | UK |
| 4th row | USA |
| 5th row | USA |
Common Values
| Value | Count | Frequency (%) |
| USA | 2987 | |
| UK | 318 | 8.5% |
| France | 101 | 2.7% |
| Germany | 80 | 2.1% |
| Canada | 59 | 1.6% |
| Australia | 39 | 1.0% |
| Spain | 21 | 0.6% |
| Japan | 15 | 0.4% |
| Hong Kong | 13 | 0.3% |
| China | 13 | 0.3% |
| Other values (35) | 110 | 2.9% |
Length
| Value | Count | Frequency (%) |
| usa | 2987 | |
| uk | 318 | 8.4% |
| france | 101 | 2.7% |
| germany | 81 | 2.1% |
| canada | 59 | 1.6% |
| australia | 39 | 1.0% |
| spain | 21 | 0.6% |
| japan | 15 | 0.4% |
| hong | 13 | 0.3% |
| kong | 13 | 0.3% |
| Other values (40) | 150 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 3305 | |
| A | 3034 | |
| S | 3019 | |
| a | 616 | 4.9% |
| n | 379 | 3.0% |
| K | 339 | 2.7% |
| r | 273 | 2.2% |
| e | 262 | 2.1% |
| i | 120 | 0.9% |
| c | 119 | 0.9% |
| Other values (35) | 1213 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10088 | |
| Lowercase Letter | 2550 | 20.1% |
| Space Separator | 41 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 616 | |
| n | 379 | |
| r | 273 | |
| e | 262 | |
| i | 120 | 4.7% |
| c | 119 | 4.7% |
| y | 98 | 3.8% |
| d | 93 | 3.6% |
| m | 93 | 3.6% |
| l | 91 | 3.6% |
| Other values (13) | 406 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 3305 | |
| A | 3034 | |
| S | 3019 | |
| K | 339 | 3.4% |
| F | 102 | 1.0% |
| G | 83 | 0.8% |
| C | 77 | 0.8% |
| I | 30 | 0.3% |
| N | 19 | 0.2% |
| H | 15 | 0.1% |
| Other values (11) | 65 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 41 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12638 | |
| Common | 41 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 3305 | |
| A | 3034 | |
| S | 3019 | |
| a | 616 | 4.9% |
| n | 379 | 3.0% |
| K | 339 | 2.7% |
| r | 273 | 2.2% |
| e | 262 | 2.1% |
| i | 120 | 0.9% |
| c | 119 | 0.9% |
| Other values (34) | 1172 | 9.3% |
Common
| Value | Count | Frequency (%) |
| 41 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12679 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 3305 | |
| A | 3034 | |
| S | 3019 | |
| a | 616 | 4.9% |
| n | 379 | 3.0% |
| K | 339 | 2.7% |
| r | 273 | 2.2% |
| e | 262 | 2.1% |
| i | 120 | 0.9% |
| c | 119 | 0.9% |
| Other values (35) | 1213 | 9.6% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.5 KiB |
| R | |
|---|---|
| PG-13 | |
| PG | |
| G | 87 |
| Not Rated | 34 |
| Other values (7) | 61 |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 2.693556976 |
| Min length | 1 |
Characters and Unicode
| Total characters | 10117 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PG-13 |
|---|---|
| 2nd row | PG-13 |
| 3rd row | PG-13 |
| 4th row | PG-13 |
| 5th row | PG-13 |
Common Values
| Value | Count | Frequency (%) |
| R | 1700 | |
| PG-13 | 1308 | |
| PG | 566 | 15.1% |
| G | 87 | 2.3% |
| Not Rated | 34 | 0.9% |
| Unrated | 22 | 0.6% |
| Approved | 17 | 0.5% |
| X | 10 | 0.3% |
| NC-17 | 6 | 0.2% |
| Passed | 3 | 0.1% |
| Other values (2) | 3 | 0.1% |
Length
| Value | Count | Frequency (%) |
| r | 1700 | |
| pg-13 | 1308 | |
| pg | 566 | 14.9% |
| g | 87 | 2.3% |
| not | 34 | 0.9% |
| rated | 34 | 0.9% |
| unrated | 22 | 0.6% |
| approved | 17 | 0.4% |
| x | 10 | 0.3% |
| nc-17 | 6 | 0.2% |
| Other values (3) | 6 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 1962 | |
| P | 1878 | |
| R | 1734 | |
| - | 1314 | |
| 1 | 1314 | |
| 3 | 1308 | |
| t | 90 | 0.9% |
| e | 76 | 0.8% |
| d | 76 | 0.8% |
| a | 59 | 0.6% |
| Other values (14) | 306 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5671 | |
| Decimal Number | 2628 | |
| Dash Punctuation | 1314 | 13.0% |
| Lowercase Letter | 470 | 4.6% |
| Space Separator | 34 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 90 | |
| e | 76 | |
| d | 76 | |
| a | 59 | |
| o | 51 | |
| r | 39 | |
| p | 34 | 7.2% |
| n | 22 | 4.7% |
| v | 17 | 3.6% |
| s | 6 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1962 | |
| P | 1878 | |
| R | 1734 | |
| N | 40 | 0.7% |
| U | 22 | 0.4% |
| A | 17 | 0.3% |
| X | 10 | 0.2% |
| C | 6 | 0.1% |
| M | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1314 | |
| 3 | 1308 | |
| 7 | 6 | 0.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1314 |
Space Separator
| Value | Count | Frequency (%) |
| 34 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6141 | |
| Common | 3976 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 1962 | |
| P | 1878 | |
| R | 1734 | |
| t | 90 | 1.5% |
| e | 76 | 1.2% |
| d | 76 | 1.2% |
| a | 59 | 1.0% |
| o | 51 | 0.8% |
| N | 40 | 0.7% |
| r | 39 | 0.6% |
| Other values (9) | 136 | 2.2% |
Common
| Value | Count | Frequency (%) |
| - | 1314 | |
| 1 | 1314 | |
| 3 | 1308 | |
| 34 | 0.9% | |
| 7 | 6 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10117 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 1962 | |
| P | 1878 | |
| R | 1734 | |
| - | 1314 | |
| 1 | 1314 | |
| 3 | 1308 | |
| t | 90 | 0.9% |
| e | 76 | 0.8% |
| d | 76 | 0.8% |
| a | 59 | 0.6% |
| Other values (14) | 306 | 3.0% |
| Distinct | 359 |
|---|---|
| Distinct (%) | 9.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46236849.64 |
| Minimum | 218 |
|---|---|
| Maximum | 1.22155 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 218 |
|---|---|
| 5-th percentile | 1200000 |
| Q1 | 10000000 |
| median | 25000000 |
| Q3 | 50000000 |
| 95-th percentile | 140000000 |
| Maximum | 1.22155 × 1010 |
| Range | 1.221549978 × 1010 |
| Interquartile range (IQR) | 40000000 |
Descriptive statistics
| Standard deviation | 226010288.5 |
|---|---|
| Coefficient of variation (CV) | 4.888098784 |
| Kurtosis | 2278.421463 |
| Mean | 46236849.64 |
| Median Absolute Deviation (MAD) | 17000000 |
| Skewness | 44.17414414 |
| Sum | 1.736656072 × 1011 |
| Variance | 5.10806505 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20000000 | 157 | 4.2% |
| 30000000 | 134 | 3.6% |
| 15000000 | 132 | 3.5% |
| 40000000 | 130 | 3.5% |
| 25000000 | 126 | 3.4% |
| 35000000 | 117 | 3.1% |
| 10000000 | 105 | 2.8% |
| 50000000 | 100 | 2.7% |
| 60000000 | 90 | 2.4% |
| 12000000 | 76 | 2.0% |
| Other values (349) | 2589 |
| Value | Count | Frequency (%) |
| 218 | 1 | |
| 1100 | 1 | |
| 4500 | 1 | |
| 7000 | 2 | |
| 10000 | 2 | |
| 14000 | 1 | |
| 15000 | 1 | |
| 23000 | 1 | |
| 25000 | 2 | |
| 40000 | 2 |
| Value | Count | Frequency (%) |
| 1.22155 × 1010 | 1 | |
| 4200000000 | 1 | |
| 2500000000 | 1 | |
| 2400000000 | 1 | |
| 2127519898 | 1 | |
| 1100000000 | 1 | |
| 1000000000 | 1 | |
| 700000000 | 2 | |
| 553632000 | 1 | |
| 400000000 | 1 |
| Distinct | 74 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2002.976571 |
| Minimum | 1927 |
|---|---|
| Maximum | 2016 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 1927 |
|---|---|
| 5-th percentile | 1985 |
| Q1 | 1999 |
| median | 2004 |
| Q3 | 2010 |
| 95-th percentile | 2014 |
| Maximum | 2016 |
| Range | 89 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 9.888108209 |
|---|---|
| Coefficient of variation (CV) | 0.004936706876 |
| Kurtosis | 8.291659729 |
| Mean | 2002.976571 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -2.070019489 |
| Sum | 7523180 |
| Variance | 97.77468395 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2002 | 190 | 5.1% |
| 2006 | 189 | 5.0% |
| 2009 | 182 | 4.8% |
| 2008 | 182 | 4.8% |
| 2005 | 182 | 4.8% |
| 2004 | 181 | 4.8% |
| 2001 | 179 | 4.8% |
| 2010 | 168 | 4.5% |
| 2011 | 168 | 4.5% |
| 2013 | 163 | 4.3% |
| Other values (64) | 1972 |
| Value | Count | Frequency (%) |
| 1927 | 1 | |
| 1929 | 1 | |
| 1933 | 1 | |
| 1935 | 1 | |
| 1936 | 1 | |
| 1937 | 1 | |
| 1939 | 2 | |
| 1940 | 1 | |
| 1946 | 2 | |
| 1947 | 1 |
| Value | Count | Frequency (%) |
| 2016 | 59 | 1.6% |
| 2015 | 128 | |
| 2014 | 145 | |
| 2013 | 163 | |
| 2012 | 158 | |
| 2011 | 168 | |
| 2010 | 168 | |
| 2009 | 182 | |
| 2008 | 182 | |
| 2007 | 152 |
actor_2_facebook_likes
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 855 |
|---|---|
| Distinct (%) | 22.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2021.775825 |
| Minimum | 0 |
|---|---|
| Maximum | 137000 |
| Zeros | 11 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 63 |
| Q1 | 384.75 |
| median | 685.5 |
| Q3 | 976 |
| 95-th percentile | 12000 |
| Maximum | 137000 |
| Range | 137000 |
| Interquartile range (IQR) | 591.25 |
Descriptive statistics
| Standard deviation | 4544.908236 |
|---|---|
| Coefficient of variation (CV) | 2.247978326 |
| Kurtosis | 211.6549776 |
| Mean | 2021.775825 |
| Median Absolute Deviation (MAD) | 293.5 |
| Skewness | 9.010297621 |
| Sum | 7593790 |
| Variance | 20656190.87 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 278 | 7.4% |
| 11000 | 107 | 2.8% |
| 2000 | 90 | 2.4% |
| 3000 | 72 | 1.9% |
| 10000 | 45 | 1.2% |
| 13000 | 39 | 1.0% |
| 14000 | 39 | 1.0% |
| 826 | 32 | 0.9% |
| 4000 | 30 | 0.8% |
| 12000 | 28 | 0.7% |
| Other values (845) | 2996 |
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 2 | 5 | |
| 3 | 6 | |
| 4 | 3 | 0.1% |
| 5 | 4 | 0.1% |
| 6 | 3 | 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 4 | 0.1% |
| 9 | 5 | |
| 10 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 137000 | 1 | < 0.1% |
| 29000 | 1 | < 0.1% |
| 27000 | 2 | 0.1% |
| 25000 | 3 | 0.1% |
| 23000 | 6 | |
| 22000 | 11 | |
| 21000 | 3 | 0.1% |
| 20000 | 6 | |
| 19000 | 7 | |
| 18000 | 9 |
| Distinct | 74 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.465282215 |
| Minimum | 1.6 |
|---|---|
| Maximum | 9.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 1.6 |
|---|---|
| 5-th percentile | 4.6 |
| Q1 | 5.9 |
| median | 6.6 |
| Q3 | 7.2 |
| 95-th percentile | 8 |
| Maximum | 9.3 |
| Range | 7.7 |
| Interquartile range (IQR) | 1.3 |
Descriptive statistics
| Standard deviation | 1.056246753 |
|---|---|
| Coefficient of variation (CV) | 0.1633721032 |
| Kurtosis | 1.146984546 |
| Mean | 6.465282215 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | -0.7233267987 |
| Sum | 24283.6 |
| Variance | 1.115657204 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.7 | 176 | 4.7% |
| 6.6 | 162 | 4.3% |
| 6.5 | 154 | 4.1% |
| 6.4 | 146 | 3.9% |
| 6.8 | 143 | 3.8% |
| 6.1 | 142 | 3.8% |
| 7.1 | 140 | 3.7% |
| 7 | 140 | 3.7% |
| 7.2 | 139 | 3.7% |
| 6.9 | 135 | 3.6% |
| Other values (64) | 2279 |
| Value | Count | Frequency (%) |
| 1.6 | 1 | < 0.1% |
| 1.9 | 2 | 0.1% |
| 2 | 1 | < 0.1% |
| 2.1 | 3 | |
| 2.2 | 1 | < 0.1% |
| 2.3 | 3 | |
| 2.4 | 2 | 0.1% |
| 2.5 | 1 | < 0.1% |
| 2.7 | 4 | |
| 2.8 | 5 |
| Value | Count | Frequency (%) |
| 9.3 | 1 | < 0.1% |
| 9.2 | 1 | < 0.1% |
| 9 | 2 | 0.1% |
| 8.9 | 4 | 0.1% |
| 8.8 | 5 | 0.1% |
| 8.7 | 7 | 0.2% |
| 8.6 | 8 | 0.2% |
| 8.5 | 19 | |
| 8.4 | 14 | |
| 8.3 | 25 |
aspect_ratio
Real number (ℝ≥0)
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.111014377 |
| Minimum | 1.18 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 1.18 |
|---|---|
| 5-th percentile | 1.85 |
| Q1 | 1.85 |
| median | 2.35 |
| Q3 | 2.35 |
| 95-th percentile | 2.35 |
| Maximum | 16 |
| Range | 14.82 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.3530679822 |
|---|---|
| Coefficient of variation (CV) | 0.1672503921 |
| Kurtosis | 636.4138366 |
| Mean | 2.111014377 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.01440583 |
| Sum | 7928.97 |
| Variance | 0.1246570001 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.35 | 1988 | |
| 1.85 | 1590 | |
| 1.37 | 48 | 1.3% |
| 1.66 | 39 | 1.0% |
| 1.78 | 34 | 0.9% |
| 1.33 | 18 | 0.5% |
| 2.2 | 11 | 0.3% |
| 2.39 | 11 | 0.3% |
| 2 | 3 | 0.1% |
| 2.4 | 3 | 0.1% |
| Other values (8) | 11 | 0.3% |
| Value | Count | Frequency (%) |
| 1.18 | 1 | < 0.1% |
| 1.33 | 18 | 0.5% |
| 1.37 | 48 | 1.3% |
| 1.5 | 1 | < 0.1% |
| 1.66 | 39 | 1.0% |
| 1.75 | 2 | 0.1% |
| 1.77 | 1 | < 0.1% |
| 1.78 | 34 | 0.9% |
| 1.85 | 1590 | |
| 2 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 16 | 1 | < 0.1% |
| 2.76 | 3 | 0.1% |
| 2.55 | 1 | < 0.1% |
| 2.4 | 3 | 0.1% |
| 2.39 | 11 | 0.3% |
| 2.35 | 1988 | |
| 2.24 | 1 | < 0.1% |
| 2.2 | 11 | 0.3% |
| 2 | 3 | 0.1% |
| 1.85 | 1590 |
| Distinct | 657 |
|---|---|
| Distinct (%) | 17.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9353.82934 |
| Minimum | 0 |
|---|---|
| Maximum | 349000 |
| Zeros | 1742 |
| Zeros (%) | 46.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 29.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 227 |
| Q3 | 11000 |
| 95-th percentile | 48000 |
| Maximum | 349000 |
| Range | 349000 |
| Interquartile range (IQR) | 11000 |
Descriptive statistics
| Standard deviation | 21462.88912 |
|---|---|
| Coefficient of variation (CV) | 2.294556416 |
| Kurtosis | 33.40142354 |
| Mean | 9353.82934 |
| Median Absolute Deviation (MAD) | 227 |
| Skewness | 4.516866747 |
| Sum | 35132983 |
| Variance | 460655609.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1742 | |
| 1000 | 84 | 2.2% |
| 11000 | 74 | 2.0% |
| 10000 | 68 | 1.8% |
| 12000 | 53 | 1.4% |
| 15000 | 50 | 1.3% |
| 13000 | 49 | 1.3% |
| 14000 | 46 | 1.2% |
| 16000 | 44 | 1.2% |
| 2000 | 38 | 1.0% |
| Other values (647) | 1508 |
| Value | Count | Frequency (%) |
| 0 | 1742 | |
| 12 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 30 | 3 | 0.1% |
| 32 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 47 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 349000 | 1 | |
| 199000 | 1 | |
| 197000 | 1 | |
| 191000 | 1 | |
| 190000 | 1 | |
| 175000 | 1 | |
| 165000 | 1 | |
| 164000 | 1 | |
| 153000 | 1 | |
| 150000 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | color | director_name | num_critic_for_reviews | duration | director_facebook_likes | actor_3_facebook_likes | actor_2_name | actor_1_facebook_likes | gross | genres | actor_1_name | movie_title | num_voted_users | cast_total_facebook_likes | actor_3_name | facenumber_in_poster | plot_keywords | movie_imdb_link | num_user_for_reviews | language | country | content_rating | budget | title_year | actor_2_facebook_likes | imdb_score | aspect_ratio | movie_facebook_likes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | Color | James Cameron | 723.0 | 178.0 | 0.0 | 855.0 | Joel David Moore | 1000.0 | 760505847.0 | Action|Adventure|Fantasy|Sci-Fi | CCH Pounder | Avatar | 886204 | 4834 | Wes Studi | 0.0 | avatar|future|marine|native|paraplegic | http://www.imdb.com/title/tt0499549/?ref_=fn_tt_tt_1 | 3054.0 | English | USA | PG-13 | 237000000.0 | 2009.0 | 936.0 | 7.9 | 1.78 | 33000 |
| 1 | 1 | Color | Gore Verbinski | 302.0 | 169.0 | 563.0 | 1000.0 | Orlando Bloom | 40000.0 | 309404152.0 | Action|Adventure|Fantasy | Johnny Depp | Pirates of the Caribbean: At World's End | 471220 | 48350 | Jack Davenport | 0.0 | goddess|marriage ceremony|marriage proposal|pirate|singapore | http://www.imdb.com/title/tt0449088/?ref_=fn_tt_tt_1 | 1238.0 | English | USA | PG-13 | 300000000.0 | 2007.0 | 5000.0 | 7.1 | 2.35 | 0 |
| 2 | 2 | Color | Sam Mendes | 602.0 | 148.0 | 0.0 | 161.0 | Rory Kinnear | 11000.0 | 200074175.0 | Action|Adventure|Thriller | Christoph Waltz | Spectre | 275868 | 11700 | Stephanie Sigman | 1.0 | bomb|espionage|sequel|spy|terrorist | http://www.imdb.com/title/tt2379713/?ref_=fn_tt_tt_1 | 994.0 | English | UK | PG-13 | 245000000.0 | 2015.0 | 393.0 | 6.8 | 2.35 | 85000 |
| 3 | 3 | Color | Christopher Nolan | 813.0 | 164.0 | 22000.0 | 23000.0 | Christian Bale | 27000.0 | 448130642.0 | Action|Thriller | Tom Hardy | The Dark Knight Rises | 1144337 | 106759 | Joseph Gordon-Levitt | 0.0 | deception|imprisonment|lawlessness|police officer|terrorist plot | http://www.imdb.com/title/tt1345836/?ref_=fn_tt_tt_1 | 2701.0 | English | USA | PG-13 | 250000000.0 | 2012.0 | 23000.0 | 8.5 | 2.35 | 164000 |
| 4 | 5 | Color | Andrew Stanton | 462.0 | 132.0 | 475.0 | 530.0 | Samantha Morton | 640.0 | 73058679.0 | Action|Adventure|Sci-Fi | Daryl Sabara | John Carter | 212204 | 1873 | Polly Walker | 1.0 | alien|american civil war|male nipple|mars|princess | http://www.imdb.com/title/tt0401729/?ref_=fn_tt_tt_1 | 738.0 | English | USA | PG-13 | 263700000.0 | 2012.0 | 632.0 | 6.6 | 2.35 | 24000 |
| 5 | 6 | Color | Sam Raimi | 392.0 | 156.0 | 0.0 | 4000.0 | James Franco | 24000.0 | 336530303.0 | Action|Adventure|Romance | J.K. Simmons | Spider-Man 3 | 383056 | 46055 | Kirsten Dunst | 0.0 | sandman|spider man|symbiote|venom|villain | http://www.imdb.com/title/tt0413300/?ref_=fn_tt_tt_1 | 1902.0 | English | USA | PG-13 | 258000000.0 | 2007.0 | 11000.0 | 6.2 | 2.35 | 0 |
| 6 | 7 | Color | Nathan Greno | 324.0 | 100.0 | 15.0 | 284.0 | Donna Murphy | 799.0 | 200807262.0 | Adventure|Animation|Comedy|Family|Fantasy|Musical|Romance | Brad Garrett | Tangled | 294810 | 2036 | M.C. Gainey | 1.0 | 17th century|based on fairy tale|disney|flower|tower | http://www.imdb.com/title/tt0398286/?ref_=fn_tt_tt_1 | 387.0 | English | USA | PG | 260000000.0 | 2010.0 | 553.0 | 7.8 | 1.85 | 29000 |
| 7 | 8 | Color | Joss Whedon | 635.0 | 141.0 | 0.0 | 19000.0 | Robert Downey Jr. | 26000.0 | 458991599.0 | Action|Adventure|Sci-Fi | Chris Hemsworth | Avengers: Age of Ultron | 462669 | 92000 | Scarlett Johansson | 4.0 | artificial intelligence|based on comic book|captain america|marvel cinematic universe|superhero | http://www.imdb.com/title/tt2395427/?ref_=fn_tt_tt_1 | 1117.0 | English | USA | PG-13 | 250000000.0 | 2015.0 | 21000.0 | 7.5 | 2.35 | 118000 |
| 8 | 9 | Color | David Yates | 375.0 | 153.0 | 282.0 | 10000.0 | Daniel Radcliffe | 25000.0 | 301956980.0 | Adventure|Family|Fantasy|Mystery | Alan Rickman | Harry Potter and the Half-Blood Prince | 321795 | 58753 | Rupert Grint | 3.0 | blood|book|love|potion|professor | http://www.imdb.com/title/tt0417741/?ref_=fn_tt_tt_1 | 973.0 | English | UK | PG | 250000000.0 | 2009.0 | 11000.0 | 7.5 | 2.35 | 10000 |
| 9 | 10 | Color | Zack Snyder | 673.0 | 183.0 | 0.0 | 2000.0 | Lauren Cohan | 15000.0 | 330249062.0 | Action|Adventure|Sci-Fi | Henry Cavill | Batman v Superman: Dawn of Justice | 371639 | 24450 | Alan D. Purwin | 0.0 | based on comic book|batman|sequel to a reboot|superhero|superman | http://www.imdb.com/title/tt2975590/?ref_=fn_tt_tt_1 | 3018.0 | English | USA | PG-13 | 250000000.0 | 2016.0 | 4000.0 | 6.9 | 2.35 | 197000 |
Last rows
| df_index | color | director_name | num_critic_for_reviews | duration | director_facebook_likes | actor_3_facebook_likes | actor_2_name | actor_1_facebook_likes | gross | genres | actor_1_name | movie_title | num_voted_users | cast_total_facebook_likes | actor_3_name | facenumber_in_poster | plot_keywords | movie_imdb_link | num_user_for_reviews | language | country | content_rating | budget | title_year | actor_2_facebook_likes | imdb_score | aspect_ratio | movie_facebook_likes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3746 | 5008 | Black and White | Kevin Smith | 136.0 | 102.0 | 0.0 | 216.0 | Brian O'Halloran | 898.0 | 3151130.0 | Comedy | Jason Mewes | Clerks | 181749 | 2103 | Jeff Anderson | 4.0 | clerk|friend|hockey|video|video store | http://www.imdb.com/title/tt0109445/?ref_=fn_tt_tt_1 | 615.0 | English | USA | R | 230000.0 | 1994.0 | 657.0 | 7.8 | 1.37 | 0 |
| 3747 | 5011 | Color | Neil LaBute | 80.0 | 97.0 | 119.0 | 7.0 | Matt Malloy | 136.0 | 2856622.0 | Comedy|Drama | Stacy Edwards | In the Company of Men | 11550 | 254 | Jason Dixie | 0.0 | business trip|love|misogynist|office|secretary | http://www.imdb.com/title/tt0119361/?ref_=fn_tt_tt_1 | 197.0 | English | Canada | R | 25000.0 | 1997.0 | 108.0 | 7.3 | 1.85 | 489 |
| 3748 | 5012 | Color | David Ayer | 233.0 | 109.0 | 453.0 | 120.0 | Martin Donovan | 1000.0 | 10499968.0 | Action|Crime|Drama|Thriller | Mireille Enos | Sabotage | 47502 | 1458 | Maurice Compte | 3.0 | dea|drug cartel|kicked in the crotch|strip club|tough girl | http://www.imdb.com/title/tt1742334/?ref_=fn_tt_tt_1 | 212.0 | English | USA | R | 35000000.0 | 2014.0 | 206.0 | 5.7 | 1.85 | 10000 |
| 3749 | 5015 | Black and White | Richard Linklater | 61.0 | 100.0 | 0.0 | 0.0 | Richard Linklater | 5.0 | 1227508.0 | Comedy|Drama | Tommy Pallotta | Slacker | 15103 | 5 | Jean Caffeine | 0.0 | austin texas|moon|pap smear|texas|twenty something | http://www.imdb.com/title/tt0102943/?ref_=fn_tt_tt_1 | 80.0 | English | USA | R | 23000.0 | 1991.0 | 0.0 | 7.1 | 1.37 | 2000 |
| 3750 | 5025 | Color | John Waters | 73.0 | 108.0 | 0.0 | 105.0 | Mink Stole | 462.0 | 180483.0 | Comedy|Crime|Horror | Divine | Pink Flamingos | 16792 | 760 | Edith Massey | 2.0 | absurd humor|egg|gross out humor|lesbian|sex | http://www.imdb.com/title/tt0069089/?ref_=fn_tt_tt_1 | 183.0 | English | USA | NC-17 | 10000.0 | 1972.0 | 143.0 | 6.1 | 1.37 | 0 |
| 3751 | 5026 | Color | Olivier Assayas | 81.0 | 110.0 | 107.0 | 45.0 | Béatrice Dalle | 576.0 | 136007.0 | Drama|Music|Romance | Maggie Cheung | Clean | 3924 | 776 | Don McKellar | 1.0 | jail|junkie|money|motel|singer | http://www.imdb.com/title/tt0388838/?ref_=fn_tt_tt_1 | 39.0 | French | France | R | 4500.0 | 2004.0 | 133.0 | 6.9 | 2.35 | 171 |
| 3752 | 5027 | Color | Jafar Panahi | 64.0 | 90.0 | 397.0 | 0.0 | Nargess Mamizadeh | 5.0 | 673780.0 | Drama | Fereshteh Sadre Orafaiy | The Circle | 4555 | 5 | Mojgan Faramarzi | 0.0 | abortion|bus|hospital|prison|prostitution | http://www.imdb.com/title/tt0255094/?ref_=fn_tt_tt_1 | 26.0 | Persian | Iran | Not Rated | 10000.0 | 2000.0 | 0.0 | 7.5 | 1.85 | 697 |
| 3753 | 5033 | Color | Shane Carruth | 143.0 | 77.0 | 291.0 | 8.0 | David Sullivan | 291.0 | 424760.0 | Drama|Sci-Fi|Thriller | Shane Carruth | Primer | 72639 | 368 | Casey Gooden | 0.0 | changing the future|independent film|invention|nonlinear timeline|time travel | http://www.imdb.com/title/tt0390384/?ref_=fn_tt_tt_1 | 371.0 | English | USA | PG-13 | 7000.0 | 2004.0 | 45.0 | 7.0 | 1.85 | 19000 |
| 3754 | 5035 | Color | Robert Rodriguez | 56.0 | 81.0 | 0.0 | 6.0 | Peter Marquardt | 121.0 | 2040920.0 | Action|Crime|Drama|Romance|Thriller | Carlos Gallardo | El Mariachi | 52055 | 147 | Consuelo Gómez | 0.0 | assassin|death|guitar|gun|mariachi | http://www.imdb.com/title/tt0104815/?ref_=fn_tt_tt_1 | 130.0 | Spanish | USA | R | 7000.0 | 1992.0 | 20.0 | 6.9 | 1.37 | 0 |
| 3755 | 5042 | Color | Jon Gunn | 43.0 | 90.0 | 16.0 | 16.0 | Brian Herzlinger | 86.0 | 85222.0 | Documentary | John August | My Date with Drew | 4285 | 163 | Jon Gunn | 0.0 | actress name in title|crush|date|four word title|video camera | http://www.imdb.com/title/tt0378407/?ref_=fn_tt_tt_1 | 84.0 | English | USA | PG | 1100.0 | 2004.0 | 23.0 | 6.6 | 1.85 | 456 |